Close

%0 Conference Proceedings
%4 sid.inpe.br/sibgrapi/2020/09.16.19.14
%2 sid.inpe.br/sibgrapi/2020/09.16.19.14.22
%@doi 10.1109/SIBGRAPI51738.2020.00050
%T 1-to-N Large Margin Classifier
%D 2020
%A Layza, Jaime Rocca,
%A Pedrini, Helio,
%A Torres, Ricardo da Silva,
%@affiliation Institute of Computing, University of Campinas, Campinas, SP, Brazil, 13083-852
%@affiliation Institute of Computing, University of Campinas, Campinas, SP, Brazil, 13083-852
%@affiliation Department of ICT and Natural Sciences, Norwegian University of Science and Technology (NTNU)
%E Musse, Soraia Raupp,
%E Cesar Junior, Roberto Marcondes,
%E Pelechano, Nuria,
%E Wang, Zhangyang (Atlas),
%B Conference on Graphics, Patterns and Images, 33 (SIBGRAPI)
%C Porto de Galinhas (virtual)
%8 7-10 Nov. 2020
%I IEEE Computer Society
%J Los Alamitos
%S Proceedings
%K Large Margin Classifier, Noise Label Data, Adversarial Attacks.
%X Cross entropy with softmax is the standard loss function for classification in neural networks. However, this function can suffer from limitations on discriminative power, lack of generalization, and propensity to overfitting. In order to address these limitations, several approaches propose to enforce a margin on the top of the neural network specifically at the softmax function. In this work, we present a novel formulation that aims to produce generalization and noise label robustness not only by imposing a margin at the top of the neural network, but also by using the entire structure of the mini-batch data. Based on the distance used for SVM to obtain maximal margin, we propose a broader distance definition called 1-to-N distance and an approximated probability function as the basis for our proposed loss function. We perform empirical experimentation on MNIST, CIFAR-10, and ImageNet32 datasets to demonstrate that our loss function has better generalization and noise label robustness properties than the traditional cross entropy method, showing improvements in the following tasks: generalization robustness, robustness in noise label data, and robustness against adversarial examples attacks.
%@language en
%3 PID6615191.pdf


Close